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ABSTRACT 


Companies are always looking for ways to keep their professional personnel on 
board in order to save money on hiring and training. Predicting whether or not a 
specific employee would depart will assist the organisation in making proactive 
decisions. Human resource problems, unlike physical systems, cannot be defined by a 
scientific-analytical formula. As a result, machine learning approaches are the most 
effective instruments for achieving this goal. In this study, a feature selection strategy 
based on a Machine Learning Classifier is proposed to improve classification accuracy, 
precision, and True Positive Rate while lowering error rates such as False Positive Rate 
and Miss Rate. Different feature selection techniques, such as Information Gain, Gain 
Ratio, Chi-Square, Correlation-based, and Fisher Exact test, are analysed with six 
Machine Learning classifiers, such as Artificial Neural Network, Support Vector 
Machine, Gradient Boosting Tree, Bagging, Random Forest, and Decision Tree, for the 
proposed approach. In this study, combining Chi-Square feature selection with a 
Gradient Boosting Tree classifier improves employee attrition classification accuracy 
while lowering error rates. 
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1. INTRODUCTION 


Employee turnover is another name for employee attrition. Wearing down is a common 
problem, and it's more prevalent in today's industry. In the vast majority of associations, it 1s 
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one of the most important difficulties [1]. Employee Defection refers to the gradual reduction 
in the number of representatives due to retirement, renunciation, or death. Wearing down rates 
vary widely from industry to industry in terms of their own principles, and these rates can also 
differ between bright and inept positions. Organizations face a daunting task of enrolment and 
gift retention, while also dealing with ability misfortune due to continual loss, whether because 
to industry midtowns or willful person turnover [2]. When a well-trained and well-adjusted 
employee quits the company, a void is created. As a result, the organisation loses important 
skills, information, and business relationships [29]. Current chiefs and individual executives 
are extremely interested in reducing wear and tear in the organisation in such a way that it will 
contribute to the organization's most extreme viable development and progress. Any business's 
representative consent is, in any case, organisation. If the situation is not handled properly, 
critical personnel departures can result in a significant loss of earnings. Representative turnover 
results in execution losses, which can have a long-term negative influence on businesses [3][4]. 
With rate reduction a significant concern for every industry, businesses endeavour to use 
innovative business methods to reduce maintenance [5]. Although there is no way to completely 
eliminate continuous loss, we can reduce it by implementing appropriate solutions. It could also 
be when a supervisor estimates the rate of employee turnover ahead of time [30]. 


Machine learning can be used to create labelled data classifications or to create hidden 
structures from unlabeled data. The ability of machine learning algorithms to anticipate the 
possibility of a person leaving an organisation can be used by top-level management of firms 
[6][7]. This procedure will aid in the control of attrition-causing causes and the prevention of 
attrition. Employee turnover is a significant issue for employers. Every organization's ability to 
retain talent is critical. As a result, if management can obtain a prediction likelihood of 
employee separation as well as the variables driving the separation, it can be useful in making 
actions that reduce attrition risk [8] [9]. Here's when machine learning comes in handy. Top 
management will take proactive actions to retain personnel based on the forecasts made by 
machine learning algorithms [10]. 


2. RELATED WORKS 


Tang, Ziyuan, Gautam Srivastava, and Shuai Liu [11] based on accounting market big data, 
offered a strategy for selecting accounting models for small and medium-sized firms (SMEs) 
(AMBD). To begin, some indicators from a company's solvency, operating capacity, 
profitability, and growth capacity are chosen, such as the current ratio, quick ratio, asset- 
liability ratio, accounts receivable turnover rate, and other indicators from the solvency, 
operating capacity, profitability, and growth capacity. Following that, the AMBD constraints 
are classified using the principal component analysis method. Finally, the optimal accounting 
model is established by iteration by combining particle swarm optimization with ant colony 
optimization. 

Marichelvam, M. K., M. Geetha, and OmurTosun [12] the effect of human variables was 
taken into account when solving the multi-stage hybrid flow shop scheduling problem with 
identical parallel machines at each level. The aim function is to minimise the weighted sum of 
the make span and total flow time. Because the problem is NP-hard, we propose an improved 
version of the particle swarm optimization (PSO) algorithm to solve it. To improve the PSO 
algorithm's initial solutions, a dispatching rule and a constructive heuristic are used. The 
variable neighbourhood search (VNS) algorithm is used with the PSO algorithm to provide the 
best results in the shortest amount of time. 


Jhaver, Mehul, Yogesh Gupta, and Amit Kumar Mishra [13] the study's unique addition is 
to investigate the usage of the Gradient Boosting technique, which is more resilient due to its 
regularisation formulation. Gradient Boosting 1s compared to three commonly used supervised 
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classifiers, such as Logistic Regression, Support Vector Machine, and Random Forest, using 
worldwide retailer data to show that it has a higher accuracy for forecasting staff turnover. 


Machado, Marcos Roberto, Salma Karray, and IvaldoTributino de Sousa [14] showcased a 
financial company's deployment of a Machine Learning model to predict customer loyalty. The 
researchers assessed the accuracy of two Gradient Boosting Decision Tree Models: 
XGBoosting and the LightGBM algorithm, which has never been used to predict customer 
loyalty. 

Keshri, Rajat, and Srividya [15] Microsoft published the LightGBM algorithm in 2017, 
which was explored. The authors compared LighGBM to other known algorithms in this paper. 
The dataset's data is used to compare LightGBM to other classification algorithms and 
demonstrate LightGBM's excellent prediction accuracy. 


Padmasini, Ms, and K. Shyamala [16] A model is presented that clusters Original 
Equipment Customers, Distributors, and Dealers in order to determine what the consumer 
thinks about the products and firm based on pricing, quality, and delivery logistics, and then 
decides to buy an automobile product and become a loyal client. The Integrated Gower based 
PSO-KMode model proposes using the Gower dissimilarity measure and optimising the data 
using Particle Swarm Optimization with K Modes clustering method to locate the existing 
consumer through a survey. 


Eitle, Verena, and Peter Buxmann [17] proposed a model to assist software sales reps in 
managing the complicated sales funnel. Data-driven qualification assistance decreases the high 
degree of arbitrariness produced by professional expertise and experiences by incorporating 
business analytics in the form of machine learning into lead and opportunity management. 
Using real business data from the company's CRM system, the authors created an artefact 
consisting of three models to map the end-to-end sales pipeline. 


3. PROPOSED FRAMEWORK FOR EMPLOYEE ATTRITION 
PREDICTION 


3.1. Artificial Neural Network Classifier 


The organic neuron employed for prediction formed the basis for Neural Network [18]. 
Network of Neurons Let's see if we can get a single neuron to learn. Figure | depicts a single 
neuron with a single input. Where O is the output, o is the sigmoid function, € 1s the neuron’'s 
input, and @ is the weight that connects the input to the neuron, the provided equation defines 
the single input neuron. 





Figure 1 Single input neuron0 


When a neuron has many inputs, as shown in Figure 2, the MLP is made up of inputs that 
are weighted and connected to the layer. The neuron then takes many inputs and forms a 
multilayer perceptron as a result. The diagram depicts a multi-layer experience. 
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Figure 2 Multilayer perceptron 
O = (§1m1+ €2w2+-:-...... + €kwk) +0 


where O is the output. 

o 1s the sigmoid function or transformed function. 
€ 1s the input to the neuron. 

@ 1s the weight of input (1 to k). 

© is the bias. 


3.2. Support Vector Machine Classifier 


A support vector machine that separates hyperplanes is described. An ideal hyperplane that 
categorises new instances is the approach performance. In a two-dimensional environment, this 
modern hyper plane divides a plane into two halves, with each class on one side. It produces 
better outcomes when dealing with categorization problems that are more complex. Each 
function that represents the coordination of the plane as a point in n-dimension is assigned a 
value to each data element. The SVM [19] is a highly effective categorizer that separates both 
groups. 





Figure 3 Support Vector Machine Hyper plane classifying two classes 
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3.3. Gradient Boosting Classifier 


Gradient boosting (GB) builds new models sequentially from a set of weak models by 
minimising loss functions for each new model. The descent of gradients determines the loss 
function. Every new model that uses the loss function better matches the observations and 
improves overall accuracy. Boosting, on the other hand, must be stopped at some point; 
otherwise, the model looks to be overfishing. The stop criterion could be a certain level of 
accuracy or a certain number of models. 


The GBDT [20] community model of sequence-trained decision-making trees is a 
community model of sequence-trained decision-making trees. GBDT adapts negative gradients 
to train decision trees in any iteration (also called residual failures). Learning decision bodies 
is the most time-consuming component of learning a decision tree in GBDT, and identifying 
the appropriate split points is the most time-consuming aspect of learning a decision tree. The 
pre-sort algorithm, which lists all possible dividing points with pre-sorting function values, is 
one of the most used dividing dots. This method is simple and can find the best splits, however 
it is unsuccessful in terms of training speeds and memory usage. 


3.4. Bagging Classifier 


Bagging methods are a powerful category of algorithms that combine many cases of black box 
estimators in random subsets of the original data set, then aggregate their predictions effectively 
to work out and construct the final prediction. The storage methods go to great lengths to 
incorporate randomization into their creation in order to reduce differences across the basic 
estimators. Consider a scenario in which you have a learner for scenario The Decision Tree. 
You've probably tried to improve the accuracy and variance of Bootstrap technology. 


e Using a next scheme technique, you can generate numerous samples of your data set 
that are classified as a training set: you can randomly imagine every variable in your 
training set and then pull it back. As a result, some of the training elements in the new 
sample are present several times, while others are missing unintentionally. The samples 
must be identical in size to the train package. 


e You can instruct your pupil to attain efficient results and refine the model on each 
produced sample. 


e If students regress, you use the procedure to estimate the average number of students, 
or to vote if they are graded. 


3.4. Random Forest Classifier 


Random Forest is a master learning technique used for grading and regression. It's a kind of 
ensemble technique that combines a bunch of weak models to create a powerful model. The 
random forest generates several tresses. Voting for that class should be used to categorise each 
tree. This is a categorization. The forest chooses the classification with the most votes. Figure 
3.4 depicts the random forest selection approach [22]. 


Take the test to evaluate the features and choose trees in order to predict and save the 
findings. 
e Determine the votes for each projected outcome. 


e Assume that the forecast with the most votes is the final forecast. 
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Figure 4 Prediction process taken by random forest 


3.6. Decision Tree Classifier 


The decision tree algorithm [23] is a supervised research method for classification and 
regression problems. Its main purpose is to create a training model that may be used to anticipate 
employee attrition decisions using data sets from previous studies. It attempts to tackle the 
problem through the use of nodes or node hierarchies. Three nodes are present: 


e Root 
e Internal Nodes 
e Leaf Nodes 


The root node represents the complete sample, which is then separated into leaf nodes that 
display the attribute divided into leaf nodes that represent the class Labels. 





Figure 5 Representation of the Decision Tree 


3.7. Proposed Research Methodology with Feature Selection Technique 


A feature selection algorithm is given in this research paper based on Chi-Square in order 
to predict employee attrition using algorithms for machine learning. The comparison of other 
functional approaches for the classification of employee attrition is often compared with the 
Chi-Square. The methods for selecting features such as correlation-based role selection, the 
gain ratio, chi-square-based feature selection, the Fishers Exact Test are used to predict 
Employee Attrition. 
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Figure 6 Proposed Research Methodology for Predicting Employee Attrition in Industry 


3.7.1. Correlation based Feature Selection 


The relationship between all features and the output class is determined in this algorithm, and 
the heuristic correlation assessment function 1s utilised to choose the best feature subset [24]. It 
examines the relationship between nominal and category features, with numerical features 
being represented by discrete values. The equation must be used to choose the correlation 
function. 


a 


\K+K(K-Dr, 


Where rz- denotes the relationship between features and class variable, K represents the 
number of features, 7,,indicates the mean value of correlated feature-classes and 7,,represents 
the mean value of inter-correlated features. 


3.7.2. Information gain Ratio Feature Selection 


Knowledge determines the amount of data received by a term in a text for class prediction. 
Gaining knowledge In relation to the development of a subgroup on the class attribute, it 
determines the relevant information value. Data speculation indexes are frequently used to 
evaluate attributes. The final goal of this research is to provide an undesired range of data gain 
or to calculate the entropy value for every data [25]. It's a supervised, univariate, simple, solid, 
symmetrical, and entropy-based feature selection approach. For the functions X and Y, the 
following information is provided: 


https:/Aaeme.com/Home/journal/IJARET editor@iaeme.com 


Employee Attrition Prediction in Industry Using Machine Learning Techniques 
Informationgain_.X .Y)= H(X)—-HA(X|Y) 


Where H(X), H (X|Y) is calculated on X and Y for entropy values. X entropy can be 
computed as 


H(X) =->  P(x;)log, (P(x;)) 


X|Y entropy calculation is shown below: 


H(X|Y)=-> PWD PG; | v,)log, (PG; |v,)) 
j i 


Similarly, this strategy analyses the ratio for each characteristic independently and chooses 
'm' as the most appropriate function, 1.e. it considers the most significant function F with a high 
information gain as the most relevant function. The fundamental disadvantage of this technique 
is that it selects a high-data-gain attribute that may or may not be more informative. Because 
the characteristics are chosen globally, knowledge acquisition is unable to handle redundant 
features. 


3.7.3. Gain Ratio Feature Selection 


The revised information gain version is the Gain Ratio. It takes into account daughter nodes, 
which are nodes in which an attribute separates the class data. This limits the appreciation for 
the fact that the process of acquiring information has four characteristics and great potential 
values [26]. Equation provides a benefit ratio. 


InformationGain 
A(X) 


Gain Ratio= 


We normalise information gain by breaking it into X entropy when forecasting variable Y, 
and vice versa. The gain ratio now has a value between 0 and | as a result of this normalisation. 
When the gain ratio is 1, it signifies that X information completely predicts Y, and there is no 
relationship between Y and X. It favours variables with lower value as compared to information 
gain. 


3.7.4. Chi-Square Feature Selection Method 


The observed and predicted frequency are two parameters in Chi-Square feature choices [27]. 
The qualities’ weights can also be discovered. The greatest weight attributes are those that match 
to the relevant attributes. This method examines the label of the class. This is where the 
predictor's variable is chosen. This attribute value with the class numbers 'r' and 'c' is defined 
as 


Where Oj; 1s the number of ‘1’ value occurrences in class ‘j’. Eij is the number of events 
predictable with the value ‘1’ and the class ‘y’. 
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3.7.5. Fisher’s Exact Test 


Using a well-known fisherman ratio concept and a heuristic approach to establish a value for 
qualities, the Fisher score can be used to choose the suitable attributes [27]. The revenue is 


e Define the features applicable to any particular issue. 
e Reduces issue size and storage in your machine. 
e Minimize calculation time to increase prediction accuracy as well. 


e Strengthen the classification by eliminating irrelevant characteristics and noise. 


4. RESULT AND DISCUSSION 


4.1. Description of the Dataset 


The employee attrition dataset is considered to evaluate the performance of the proposed 
framework with various feature selection and classifiers. The employee attrition dataset is taken 
from Kaggle repository [28]. The following table | represents the dataset used in this research 
work. 


Table 1 IBM Employee Attrition Dataset 


| 6 _—(DistanceFromHome 
| 8 (Education Field 
| 9 ~~ _‘(EmployeeCount 


23 
24 
25 


. . 


26 Relationship Satisfaction 
| 27. |Standard Hours 

28 Stock Option Level 

29 Total Working Years 


| = 30.——Ss«*Trraining time last year 
ork life balance 
| «32s at Company 


Years in current role 


| 34.—s«*fYears since last promotion 


Years with current manager 


ob Role 


15 
16 
17 
18 
19 
20 


NO 
NO 
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4.2. Number of Features obtained by Feature Selection Methods 


Table 2 gives the number of features obtained by implementing the feature selection techniques 
like Correlation based Feature Selection (CFS), Information Gain (IG), Gain Ratio (GR), Chi- 
Square, and Fisher Exact Test. From the table 2, it is clear that the Chi-Square Feature Selection 
method gives a smaller number of features when comparing with other feature selection 
methods. 


Table 2 Number of Features obtained by Feature Selection Methods 


Gain Ratio 





Chi-Square 


Fisher Exact Test 


4.3. Performance Analysis of the Feature Selection Methods 


The performance metrics like Classification Accuracy, True Positive Rate (TPR), Precision, 
False Positive, and Miss Rate are considered in this paper to evaluate the performance of the 
feature selection methods in the prediction of employee attrition in industry using different 
Machine Learning classifiers. 


Table 3 gives the classification accuracy obtained by the feature selection methods using 
various classifiers. Table 4 depicts the True Positive Rate (in %) obtained by Feature Selection 
Methods using different classifiers. Table 5 presents the precision (in %) obtained by the feature 
selection methods using various classifiers. Table 6 depicts the False Positive Rate (in %) 
obtained by the feature selection methods using various classifiers. Table 7 gives the miss rate 
(in %) obtained by the feature selection methods using various classifiers. 


Table 3 Classification accuracy (in %) obtained by Feature Selection Methods using different 
classifiers 


Feature Selection 
Methods 


Table 4 True Positive Rate (in %) obtained by Feature Selection Methods using different classifiers 





Feature Selection True Positive Rate (in %) by Classification Techniques 


Methods 
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Table 5 Precision (in %) obtained by Feature Selection Methods using different classifiers 


Feature Selection 
Methods 


Table 6 False Positive Rate (in %) obtained by Feature Selection Methods using different classifiers 


Feature Selection 
Methods 


Table 7 Miss Rate (in %) obtained by Feature Selection Methods using different classifiers 


Feature Selection 
Methods 


From the table 3, table 4, table 5, table 6 and table 7, it is clear that the Chi-Square Feature 
Selection with Gradient Boosting Tree (GBT) classifiers increased the classification accuracy, 
TPR, Precision, and also it reduced the error rates like FPR and Miss Rate for predicting the 
employee attrition in industry. 











5. CONCLUSION 


Employee attrition prediction has become a key issue in today's organisations. Employee 
attrition 1s a major problem for businesses, especially when trained, technical, and critical staff 
leave for better opportunities elsewhere. This leads in a financial loss as a trained employee 
must be replaced. Feature selection 1s critical in the pre-processing stage of data mining, and 
several data mining machine learning approaches struggle to manage vast volumes of irrelevant 
characteristics. Various feature selection strategies are used in this research article to improve 
the accuracy of employee attrition prediction in the industry. The performance of the different 
six classifiers, such as ANN, SVM, GBT, Bagging, RF, and DT, is tested for the prediction of 
employee attrition using Feature Selection approaches such as Information Gain, Gain Ratio, 
Chi-Square, Correlation based, Fisher's Exact. The Chi-Square Feature Selection with Gradient 
Boosting Tree classifier performs better in the prediction of employee attrition than other 
feature selection techniques with other classifiers, as evidenced by the results. 


https:/Aaeme.com/Home/journal/IJARET editor @iaeme.com 


Employee Attrition Prediction in Industry Using Machine Learning Techniques 


REFERENCES 


[1] 


[2] 


[3] 


[4] 


[5] 


[6] 


[7] 


[3] 


[9] 


[10] 


[11] 


[12] 


[13] 


[14] 


Gopinath, R. "Impact of Stress Management by development of Emotional Intelligence in 
CMTS, BSNL, Tamilnadu Circle-A Study." International Journal of Management Research 
and Development (IJMRD) 4.1 (2014). 


Gopinath, R. "Prominence of Self-Actualization in Organization." International Journal of 
Advanced Science and Technology 29.3 (2020): 11591-11602. 


Gopinath, R. (2019). Quality of Work Life (QWL) among the Employees of LIC, International 
Journal of Scientific Research and Review, 8(5), 373-377. 


Gopinath, R. "Relationship Between Knowledge Management and Human Resource 
Development—A Study on Telecommunication Industry." Suraj Punj Journal for 
Multidisciplinary Research 9.5 (2019): 477-480. 


Gopinath, R., &Kalpana, R. (2020). Relationship of Job Involvement with Job Satisfaction. 
Adalya Journal, 9 (7), 306-315. 


Fan, Chin-Yuan, et al. "Using hybrid data mining and machine learning clustering analysis to 
predict the turnover rate for technology professionals." Expert Systems with Applications 39.10 
(2012): 8844-8851. 


Samuel, Michael O., and Crispen Chipunza. "Employee retention and turnover: Using 
motivational variables as a panacea.” African journal of business management 3.9 (2009): 410- 
415. 


Samuel, Michael O., and Crispen Chipunza. "Employee retention and turnover: Using 
motivational variables as a panacea." African journal of business management 3.9 (2009): 410- 
415. 


Glebbeek, Arie C., and Erik H. Bax. "Is high employee turnover really harmful? An empirical 
test using company records." Academy of management journal 47.2 (2004): 277-286. 


Allen, David G. Retaining talent: A guide to analysing and managing employee turnover. SHRM 
Foundations, 2008. 


Tang, Ziyuan, Gautam Srivastava, and Shuai Liu. "Swarm intelligence and ant colony 
optimization in accounting model choices." Journal of Intelligent & Fuzzy Systems Preprint 
(2020): 1-9. 


Marichelvam, M. K., M. Geetha, and Omiir Tosun. "An improved particle swarm optimization 
algorithm to solve hybrid flowshop scheduling problems with the effect of human factors—A 
case study." Computers & Operations Research 114 (2020): 104812. 


Jhaver, Mehul, Yogesh Gupta, and Amit Kumar Mishra. "Employee Turnover Prediction 
System." 2019 4th International Conference on Information Systems and Computer Networks 
(ISCON).IEEE, 2019. 


Machado, Marcos Roberto, Salma Karray, and Ivaldo Tributino de Sousa. "LightGBM: An 
effective decision tree gradient boosting method to predict customer loyalty in the finance 
industry." 2019 14th International Conference on Computer Science & Education 
(ICCSE).TEEE, 2019. 


https:/Aaeme.com/Home/journal/IJARET editor @iaeme.com 


[13] 


[16] 


[17] 


[18] 


[19] 


[20] 


[21] 


[22] 


[23] 


[24] 


[25] 


[26] 


[27] 


[28] 


[29] 


[30] 


M. Subhashini and R. Gopinath 


Keshri, Rajat, and P. Srividya. "Prediction of Employee Turnover Using Light GBM 
Algorithm." 


Padmasini, Ms, and K. Shyamala. "An Integrated Gower based PSO-K Mode Clustering Model 
for Business Solutions through Existing Customer Assessment." 


Eitle, Verena, and Peter Buxmann. "Business analytics for sales pipeline management in the 
software industry: a machine learning perspective." Proceedings of the 52nd Hawaii 
International Conference on System Sciences. 2019. 


Dutta, Shawni, and Samir Kumar Bandyopadhyay. "Employee attrition prediction using neural 
network cross validation method." /nternational Journal of Commerce and Management 
Research (2020). 


Kim, Soo Y. "Prediction of hotel bankruptcy using support vector machine, artificial neural 
network, logistic regression, and multivariate discriminant analysis." The Service Industries 
Journal 31.3 (2011): 441-468. 


Qutub, Aseel, et al. "Prediction of Employee Attrition Using Machine Learning and Ensemble 
Methods." Int. J. Mach. Learn. Comput 11 (2021). 


[21] | Bhuva, Kashyap, and KritiSrivastava. "Comparative Study of the Machine Learning 
Techniques for Predicting the Employee Attrition.” [JRAR-International Journal of Research 
and Analytical Reviews (IJRAR) 5.3 (2018): 568-577. 


Sisodia, Dilip Singh, Somdutta Vishwakarma, and Abinash Pujahari. "Evaluation of machine 
learning models for employee churn prediction." 20/7 International Conference on Inventive 
Computing and Informatics (ICICI). IEEE, 2017. 


Alao, D. A. B. A., and A. B. Adeyemo. "Analyzing employee attrition using decision tree 
algorithms." Computing, Information Systems, Development Informatics and Allied Research 
Journal 4.1 (2013): 17-28. 


Najafi-Zangeneh, Saeed, et al. "An Improved Machine Learning-Based Employees Attrition 
Prediction Framework with Emphasis on Feature Selection." Mathematics 9.11 (2021): 1226. 


Jain, Divyang. Evaluation of Employee Attrition by Effective Feature Selection using Hybrid 
Model of Ensemble Methods. Diss. Dublin, National College of Ireland, 2017. 


Ozdemir, Fatma. Recommender System For Employee Attrition Prediction And Movie 
Suggestion. Diss. Abdullah Gul University, 2020. 


PM, Usha, and N. V. Balaji. "Chi Square Selector Enhanced Fuzzy Clustering Method for 
Employee Attrition Prediction." Design Engineering (2021): 3405-3425. 


https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-attrition-dataset 


Gopinath, R., and Chitra, A. (2020) Emotional Intelligence and Job Satisfaction of Employees 
at Sago Companies in Salem District: Relationship Study. Adalya Journal, 9 (6), pp. 203-217. 


Gopinath, R., and N. S. Shibu. "A study on few HRD related entities influencing Job Satisfaction 
in BSNL, Tamil Nadu Telecom Circle." Annamalai Business Review, Special Issue (2015): 24- 
30. 


https:/Aaeme.com/Home/journal/IJARET editor @iaeme.com 


